Improving Cluster Management with Scalable Filesystems
نویسندگان
چکیده
Reducing the complexity of the hardware and software components of Linux cluster systems can significantly improve management infrastructure scalability. Moving parts, in particular hard drives, generate excess heat and have the highest failure rates among cluster node components. The use of diskless nodes simplifies deployment and management, improves overall system reliability, and reduces operational costs. Previous diskless node implementations have relied on a central server exporting node images using a high-level protocol such as NFS or have employed virtual disks and a block protocol such as iSCSI to remotely store the root filesystem. We present a mechanism to provide the root filesystems of diskless computation nodes using the Lustre high-performance cluster file system. In addition to eliminating the downtime caused by disk failures, this architecture allows for highly scalable I/O performance that can be free from the single point of failure of a central fileserver. We evaluate our management architecture using a small cluster of diskless computation nodes and extrapolate from our results the ability to provide the manageability, scalability, performance and reliability required by current and future cluster designs.
منابع مشابه
Development of a Burst Buffer System for Data-Intensive Applications
Modern parallel filesystems such as Lustre are designed to provide high, scalable I/O bandwidth in response to growing I/O requirements; however, the bursty I/O characteristics of many data-intensive scientific applications make it difficult for back-end parallel filesystems to efficiently handle I/O requests. A burst buffer system, through which data can be temporarily buffered via high-perfor...
متن کاملLogFS - finally a scalable flash file system
Currently, two different solutions for filesystems on flash storage exist: Flash Translation Layers in combination with traditional filesystems and log-structured flash file systems. This paper analyzes the weaknesses of both approaches and derives a number of requirements that help avoiding these weaknesses. Finally the basic concept of a tree structured flash file system fulfilling all of the...
متن کاملCloud Analytics: Do We Really Need to Reinvent the Storage Stack?
Cloud computing offers a powerful abstraction that provides a scalable, virtualized infrastructure as a service where the complexity of fine-grained resource management is hidden from the end-user. Running data analytics applications in the cloud on extremely large data sets is gaining traction as the underlying infrastructure can meet the extreme demands of scalability. Typically, these applic...
متن کاملKhnum - A Scalable Rapid Application Deployment System for Dynamic Hosting Infrastructures
In a dynamically scalable hosting infrastructure for e-business computing, servers need to be quickly allocated in order to satisfy a sudden demand for increased computing power for a hosted site. Khnum is the applications and data management component of Océano a dynamically scalable hosting infrastructure for e-business computing utilities. It is responsible for server reconfiguration and for...
متن کاملInteractive exploration of large filesystems
Secure management of file systems of large organizations can present significant challenges to system administrators, in terms of the number of users, shared access to parts of the file system for supporting large software projects, and securing and monitoring critical parts of the file system from intruders. We present interactive visualization tools for monitoring and viewing the complex acce...
متن کامل